# PORTAL 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: <• The ACM Digital Library C The Guide 
[♦fragment +delay +conflict +position +tile +token shading sh 



r 



Terms used 

fragment delay conflict position tile token shadin g shader 



Sort results | re | evance T[ ^ Save results to a Binder 

. r ^ Search Tips 

|expandedform r 0pen reS ults in a new 
window 



Feedback Report a problem Satisfaction 
survey 

Found 16 of 171,143 

Try an Advanced Search 

Try this search in The A CM Guide 



Display 
results 



Results 1 - 16 of 16 



Relevance scale UUHil 



1 Real- time shading 

Marc Olano, Kurt Akeley, John C Hart, Wolfgang Heidrich, Michael McCool, Jason L. Mitchell, 
^ Randi Rost 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes GRAPH 



•04 

Publisher: ACM Press 

Full text available:^ Pdf(7.39 MB) 



Additional Information: full citation , abstract 



Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today's real-time shading capabili ... 



Realizing OpenGL: two implementations of one architecture 
Mark J. Kilgard 

August 1997 Proceedings of the ACM SIGGRAPH/ EUROGRAPHICS workshop on 
Graphics hardware 

Publisher: ACM Press 
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S pecial issue on knowled g e representation 

Ronald J. Brachman, Brian C. Smith 

February 1980 ACM SIGART Bulletin, issue 70 

Publisher: ACM Press 

Full text available: ^ pdf(13.13 MB) Additional Information: full citation , abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter 
devoted to a survey of current knowledge representation research. We felt that there 
were twe useful functions such an issue could serve. First, we hoped to elicit a clear 
picture of how people working in this subdiscipline understand knowledge representation 
research, to illuminate the issues on which current research is focused, and to catalogue 
what approaches and techniques are currently being developed. Secon ... 
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A structural view of the Cedar p rogramming environment 

Daniel C. Swinehart, Polle T. Zellweger, Richard J. Beach, Robert B. Hagmann 

August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 8 Issue 4 
Publisher: ACM Press 

Full text available: ffl pdf( 6.32 MB) Additional Information: full citation , abstract, naferences, citings, index 
" terms 

This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are 
organized. Cedar supports the development of programs written in a single programming 
language, also called Cedar. Its primary purpose is to increase the productivity of 
programmers whose activities include experimental programming and the development of 
prototype software systems for a high-performance personal computer. T ... 

Ex ploitin g weak connectivity for mobile file access 
L. B. Mummert, M. R. Ebling, M. Satyanarayanan 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth 
ACM symposium on Operating systems principles SOSP '95, volume 29 
Issue 5 

Publisher: ACM Press 
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Dynamicall y scheduled VLIW processors 
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December 1993 Proceedings of the 26th annual international symposium on 
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Fortran 8X draft 
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Publisher: ACM Press 

Full text available: ^gpdf (21.36 MB) Additional Information: full citation , abstract , index term s 

Standard Programming Language Fortran. This standard specifies the form and 
establishes the interpretation of programs expressed in the Fortran language. It consists 
of the specification of the language Fortran. No subsets are specified in this standard. The 
previous standard, commonly known as "FORTRAN 77", is entirely contained within this 
standard, known as "Fortran 8x". Therefore, any standard-conforming FORTRAN 77 
program is standard conforming under this standard. New features can b ... 



Distributed file systems: concepts and examp les 
Eliezer Levy, Abraham Silberschatz 

December 1990 ACM Computing Surveys (CSUR), Volume 22 issue 4 
Publisher: ACM Press 
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The purpose of a distributed file system (DFS) is to allow users of physically distributed 
computers to share data and storage resources by using a common file system. A typical 



configuration for a DFS is a collection of workstations and mainframes connected by a 
local area network (LAN). A DFS is implemented as part of the operating system of each 
of the connected computers. This paper establishes a viewpoint that emphasizes the 
dispersed structure and decentralization of both data and con ... 

10 Dis tributed operatin g s ystems 
Andrew S. Tanenbaum, Robbert Van Renesse 

December 1985 ACM Computing Surveys (CSUR), Volume 17 issue 4 
Publisher: ACM Press 

Full text available: ffl pdf(5.49 MB) Additional Information: full citation , abstract, references , citings, index 

terms , review 

Distributed operating systems have many aspects in common with centralized ones, but 
they also differ in certain ways. This paper is intended as an introduction to distributed 
operating systems, and especially to current university research about them. After a 
discussion of what constitutes a distributed operating system and how it is distinguished 
from a computer network, various key design issues are discussed. Then several 
examples of current research projects are examined in some detail ... 

11 Fault-tolerance in the advanced automation s ystem 
Flaviu Cristian, Bob Dancey, Jon Dehn 

September 1990 Proceedings of the 4th workshop on ACM SIGOPS European 
workshop 

Publisher: ACM Press 

Full text available: ^)pdf (1.39 MB ) Additional Information: full citation , abstract , references , citings 

The Advanced Automation System is a distributed real-time system under development 
by IBM's Systems Integration Division for the US Federal Aviation Administration. The 
system is intended to replace the present en-route and terminal approach US air traffic 
control computer systems over the next decade. High availability of air traffic control 
services is an essential requirement of the system. This paper discusses the general 
approach to fault-tolerance adopted in AAS, by reviewing some of the q ... 



12 Automating the lexical and syntactic design of graphical user interfaces: the UofA* 
UIMS 

Gurminder Singh, Mark Green 
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13 PAMELA: a rule-based Al langua g e for process-control applications 
F. Barachini 

June 1988 Proceedings of the 1st international conference on Industrial and 

engineering applications of artificial intelligence and expert systems - 
Volume 2 IEA/AIE 88 

Publisher: ACM Press 
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14 S pecial section: Reasoning about structure, behavior and function 

B. Chandrasekaran, Rob Milne 
>/ July 1985 ACM SIGART Bulletin, Issue 93 

Publisher: ACM Press 

Full text available:^ pdf(5. 13 MB) Additional Information: full citation , abstract , references 

The last several years* of work in the area of knowledge-based systems has resulted in a 
deeper understanding of the potentials of the current generation of ideas, but more 
importantly, also about their limitations and the need for research both in a broader 
framework as well as in new directions. The following ideas seem to us to be worthy of 
note in this connection. 



15 Draft report on requirements for a common prototyping system 
yaw R. P. Gabriel 

>/ March 1989 ACM SIGPLAN Notices, Volume 24 Issue 3 
Publisher: ACM Press 
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16 Modeling for text compression 

Timothy Bell, Ian H. Witten, John G. Cleary 

December 1989 ACM Computing Surveys (CSUR), volume 21 issue 4 
Publisher: ACM Press 

Full text available* pdf(3,54 MB) Additional Information: full citation , abstract , references , citings, index 

terms , review 

The best schemes for text compression use large models to help them predict which 
characters will come next. The actual next characters are coded with respect to the 
prediction, resulting in compression of information. Models are best formed adaptively, 
based on the text seen so far. This paper surveys successful strategies for adaptive 
modeling that are suitable for use in practical text compression systems. The strategies 
fall into three main classes: finite-context modeling, i ... 
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July 2003 ACM Transactions on Graphics (TOG), volume 22 issue 3 
Publisher: ACM Press 

Full text available: ^ pdfd.67 MB) Additional Information: full citation , abstract, references , index terms 

In causal processes decisions do not depend on future data. Many well-known problems, 
such as occlusion culling, order-independent transparency and edge antialiasing cannot be 
properly solved using the traditional causal rendering architectures, because future data 
may change the interpretation of current events. We propose adding a delay stream 
between the vertex and pixel processing units. While a triangle resides in the delay 
stream, subsequent triangles generate occlusion information. ... 

Keywords: 3D graphics hardware, antialiasing, occlusion culling, order-independent 
transparency, stream processing 
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Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today's real-time shading capabili ... 
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August 1997 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on 
Graphics hardware 

Publisher: ACM Press 

Full text available: 'g pdfd.SS MB) Additional Information: full citation , references , citing s, index terms 



Keywords: 02, OpenGL, graphics hardware architecture, infinite-reality 



4 



S pecial issue on knowledge representation 

Ronald J. Brachman, Brian C. Smith 

February 1980 ACM SIGART Bulletin, issue 70 

Publisher: ACM Press 
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In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter 
devoted to a survey of current knowledge representation research. We felt that there 
were twe useful functions such an issue could serve. First, we hoped to elicit a clear 
picture of how people working in this subdiscipline understand knowledge representation 
research, to illuminate the issues on which current research is focused, and to catalogue 
what approaches and techniques are currently being developed. Secon ... 

A C haracterization of Ten Hidden-Surface Al g orithms 
Evan E. Sutherland, Robert F. Sproull, Robert A. Schumacker 
March 1974 ACM Computing Surveys (CSUR), Volume 6 issue l 

Publisher: ACM Press 
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I. Kadayif, M. Kandemir 

May 2005 ACM Transactions on Embedded Computing Systems (TECS), volume 4 issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(978.44 KB) Additional Information: full citation , abstract , references , index terms 

Improving locality of data references is becoming increasingly important due to increasing 
gap between processor cycle times and off-chip memory access latencies. Improving data 
locality not only improves effective memory access time but also reduces memory system 
energy consumption due to data references. An optimizing compiler can play an important 
role in enhancing data locality in array-intensive embedded media applications with 
regular data access patterns.This paper presents a compiler-bas ... 

Keywords: Software compilation, array-intensive applications, data locality, iteration 
space tiling, scratch pad memory 
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This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are 
organized. Cedar supports the development of programs written in a single programming 
language, also called Cedar. Its primary purpose is to increase the productivity of 
programmers whose activities include experimental programming and the development of 
prototype software systems for a high-performance personal computer. T ... 
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Peter J. Denning 
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10 High -level synthesis of distributed logic-memory architectures 
Chao Huang, Srivaths Ravi, Anand Raghunathan, Niraj K. Jha 

November 2002 Proceedings of the 2002 IEEE/ACM international conference on 
Computer-aided design 

Publisher: ACM Press 

Full text available: ^ pdf(1.10 MB) Additional Information: full citation , abstract , references , index terms 

With the increasing cost of global communication on-chip, high-performance designs for 
data-intensive applications require architectures that distribute hardware resources 
(computing logic, memories, interconnect, etc.) throughout a chip, while restricting 
computations and communications to geographic proximities. In this paper, we present a 
methodology for high-level synthesis (HLS) of distributed logic-memory architectures, 
i.e., architectures that have logic and memory distribut ... 




11 Exploiting weak connectivity for mobile file access 
A. L B. Mummert, M. R. Ebling, M. Satyanarayanan 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth 

ACM symposium on Operating systems principles SOSP '95, volume 29 

Issue 5 

Publisher: ACM Press 

Full text available: 'Q pdfd .49 MB) Additional Information: full citation , r eference s, citings, index terms 



12 Que r y evaluation techniques for large databases 
a Goetz Graefe 

>/ June 1993 ACM Computing Surveys (CSUR), Volume 25 issue 2 
Publisher: ACM Press 

Full text available: ffl odf(9.37 MB) Additional lnformati ™ Mcitatipn, abstract, references, citings, index 
^ terms , review 

Database management systems will continue to manage large data volumes. Thus, 
efficient algorithms for accessing and manipulating large sets and sequences will be 
required to provide acceptable performance. The advent of object-oriented and extensible 
database systems will not solve this problem. On the contrary, modern data models 
exacerbate the problem: In order to manipulate large sets of complex objects as 
efficiently as today's database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 
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Publisher: ACM Press 
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terms , review 

Disk arrays were proposed in the 1980s as a way to use parallelism between multiple 
disks to improve aggregate I/O performance. Today they appear in the product lines of 
most major computer manufacturers. This article gives a comprehensive overview of disk 
arrays and provides a framework in which to organize current and future work. First, the 
article introduces disk technology and reviews the driving forces that have popularized 
disk arrays: performance and reliability. It discusses the tw ... 

Keywords: RAID, disk array, parallel I/O, redundancy, storage, striping 
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Keywords: VLIW processors, dynamic scheduling, multiple operation issue, out-of-order 
execution, scoreboarding 
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Milenko Drinic, Darko Kirovski, Hoi Vo 

March 2003 Proceedings of the international symposium on Code generation and 
optimization: feedback-directed and runtime optimization CGO '03 

Publisher: IEEE Computer Society 
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With the emergence of software delivery platforms such as Microsoft's .NET, reduced size 
of transmitted binaries has become a very important system parameter strongly affecting 
system performance. In this paper, we present two novel pre-processing steps for code 
compression that explore program binaries' syntax and semantics to achieve superior 
compression ratios. The first preprocessing step involves heuristic partitioning of a 
program binary into streams with high auto-correlation. The second ... 

17 Zero-cycle loads: microarchitecture support for reducin g load latenc y 
Todd M. Austin, Gurindar S. Sohi 

December 1995 Proceedings of the 28th annual international symposium on 
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Full text available: 'g pdfd.SS MB) Additional Information: full citation , references , citing s, index terms 
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June 2005 ACM SIGPLAN Notices , Proceedings of the 2005 ACM SIGPLAN conference 
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Issue 6 

Publisher: ACM Press 

Full text available: *g| pdf(347.77 KB) Additional Information: full citation , abstract , references , index terms 

Faults that occur in production systems are the most important faults to fix, but most 
production systems lack the debugging facilities present in development environments. 
TraceBack provides debugging information for production systems by providing execution 
history data about program problems (such as crashes, hangs, and exceptions). 



TraceBack supports features commonly found in production environments such as 
multiple threads, dynamically loaded modules, multiple source languages (e.g., Java 
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^ Peizong Lee, Zvi Meir Kedem 

January 2002 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 24 Issue 1 
Publisher: ACM Press 

Full text available:^ pdfd. 15 MB) Additional Information: full citation , abstract , references , index terms 

To exploit parallelism on shared memory parallel computers (SMPCs), it is natural to focus 
on decomposing the computation (mainly by distributing the iterations of the nested Do- 
Loops). In contrast, on distributed memory parallel computers (DMPCs), the 
decomposition of computation and the distribution of data must both be handled— in 
order to balance the computation load and to minimize the migration of data. We propose 
and validate experimentally a method for handling computations and data syn ... 

Keywords: Computation decomposition, data alignment, data distribution, distributed- 
memory computers, dominant data array, iteration space mapping vector, parallelizing 
compilers, spatial dependence vector, temporal dependence vector, tiling techniques 
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Publisher: ACM Press 
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Zebra is a network file system that increases throughput by striping file data across 
multiple servers. Rather than striping each file separately, Zebra forms all the new data 
from each client into a single stream, which It then stripes using an approach similar to a 
log -structured file system. This provides high performance for writes of small files as well 
as for reads and writes of large files. Zebra also writes parity information in each stripe in 
the style of RAID disk arrays; this increase ... 
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Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
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In this paper global patterns of thematic text organization are considered within the 
framework of a distributed model of text understanding. Based on the parsing results of 
prior text cohesion analysis, specialized text grammar modules determine whether some 
well-defined text macro-organization pattern is computable from the available text 
representation structures. The model underlying text coherence parsing formalizes hither- 
to entirely intuitive textlinguistic notions whose origin can be tra ... 
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importantly, also about their limitations and the need for research both in a broader 
framework as well as in new directions. The following ideas seem to us to be worthy of 
note in this connection. 

4 Draft report on reguirements for a common prototy ping system Q 
sfev R. P. Gabriel 

^ March 1989 ACM SIGPLAN Notices, Volume 24 issue 3 
Publisher: ACM Press 

Full text available: *Qpdf (4.76 MB) Additional Information: full citation , citings, index terms 




5 To ward relaxin g assumptions in lang uag es and their implementations 
<gv Mary Shaw, Wm. A. Wulf 

March 1980 ACM SIGPLAN Notices, volume 15 issue 3 

Publisher: ACM Press 

Full text available: ^| pdff1.47 MB) Additional Information: full citation , abstract , references , citings 
Language implementors frequently make pre-emptive decisions concerning the exact 



implementations of language features. These decisions constrain programmers' control 
over their computations and may tempt them to write involuted code to obtain special (or 
efficient) effects. In many cases, we can distinguish some properties of a language facility 
that are essential to the semantics and other properties that are incidental. Recent 
abstraction techniques emphasize dealing with such distinctions by ... 

6 Reducin g data cache leaka g e energy using a compiler-based approach 
Wei Zhang, Mahmut Kandemir, Mustafa Karakoy, Guangyu Chen 

August 2005 ACM Transactions on Embedded Computing Systems (TECS), volume 4 issue 
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Publisher: ACM Press 

Full text available: ^ pdf(750.57 KB) Additional Information: full citation , abstract , references , i ndex terms 

Silicon technology advances have made it possible to pack millions of transistors- 
switching at high clock speeds— on a single chip. While these advances bring 
unprecedented performance to electronic products, they also pose difficult power/energy 
consumption problems. For example, large number of transistors in dense on-chip cache 
memories consume significant static (leakage) power even if the cache is not used by the 
current computation. While previous compiler research studied code and data ... 

Keywords: Compiler analysis, array-intensive applications, data caches, energy 
optimization, pointer-intensive applications 
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Ty pe theories and object-oriented p ro grammim g 
Scott Danforth, Chris Tomlinson 
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Full text available* 1?) df(4 39 MB) Additional Information: full citation , abstract , references , citing s, index 
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Object-oriented programming is becoming a popular approach to the construction of 
complex software systems. Benefits of object orientation include support for modular 
design, code sharing, and extensibility. In order to make the most of these advantages, a 
type theory for objects and their interactions should be developed to aid checking and 
controlled derivation of programs and to support early binding of code bodies for 
efficiency. As a step in this direction, this paper surveys a number ... 

Desi gning complex systems — a structured activity 
Gerrit C. van der Veer, Johannes C. van Vliet, Bert F. Lenting 

August 1995 Proceedings of the conference on Designing interactive systems: 
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Publisher: ACM Press 
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>r November 2002 Proceedings of the 2002 IEEE/ACM international conference on 
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Publisher: ACM Press 

Full text available: Q pdf(156.76 KB) Additional Information: full citation , abstract , references , index terms 

While previous compiler research indicates that significant improvements in energy 
efficiency may be possible if properly optimized code is used, the energy constraints 
under which a given application code should be optimized may not always be available at 
compile-time. More importantly, these constraints may change dynamically during the 
course of execution. In this work, we present a dynamic recompilation/linking framework 
using which the energy behavior of a given application can be optimized ... 
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Barbara Liskov, Maurice Herlihy, Lucy Gilbert 



January 1986 Proceedings of the 13th ACM SIGACT-SIGPLAN symposium on 
Principles of programming languages 

Publisher: ACM Press 

Full text available: ^j pdf(1.48 MB ) Additional Information: full citation , abstract , references , citings 

Modules in a distributed program are active, communicating entities. A language for 
distributed programs must choose a set of communication primitives and a structure for 
processes. This paper examines one possible choice: synchronous communication 
primitives (such as rendez-vous or remote procedure call) in combination with modules 
that encompass a fixed number of processes (such as Ada tasks or UNIX processes). An 
analysis of the concurrency requirements of distributed programs suggests that t ... 



11 Regis ter allocation with instruction schedulin g 
Shiomit S. Pinter 

June 1993 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1993 conference 
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Publisher: ACM Press 
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Full text available: TO pdf(931.91 KB) a 
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We present a new framework in which considerations of both register allocation and 
instruction scheduling can be applied uniformly and simultaneously. In this framework an 
optimal coloring of a graph, called the parallel interference graph, provides an optimal 
register allocation and preserves the property that no false dependences are introduced, 
thus all the options for parallelism are kept for the scheduler to handle. For this 
framework we provide heuristics for trading ... 
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^ David Bernstein, Doron Cohen, Dror E. Maydan 
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Publisher: ACM Press 
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Memory disambiguation, or alias analysis, is a key component of modern optimizing 
compilers. Any optimization that reorders or changes code containing memory operations 
must analyze the memory references to ensure that the original semantics of the program 
are not changed.The recent proliferation of machines able to exploit parallelism, either at 
the coarse grain or more commonly at the instruction level, has led to the development of 
sophisticated memory disambiguation algorithm ... 
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Publisher: ACM Press 

Full text available: ^ pdf(7.39 MB) Additional Information: full citation , abstract 

Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today's real-time shading capabili ... 



Realizin g O penGL: two implementations of one architecture 
Mark J. Kilgard 

August 1997 Proceedings of the ACM SIGGRAPH/ EUROGRAPHICS workshop on 
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Existing techniques for traversing a polygon generate fragments one (or more) rows or 
columns at a time. (A fragment is all the information needed to paint one pixel of the 
polygon.) This order is non-optimal for many operations. For example, most frame buffers 
are tiled into rectangular pages, and there is a cost associated with accessing a different 
page. Pixel processing is more efficient if all fragments of a polygon on one page are 
generated before any fragments on a different page. Si ... 

Keywords: graphics accelerators, rasterization, tiling 
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This paper presents a storage scheme which statically assigns pixel/texel coordinates to 
multiple memory banks in order to minimize frame buffer and texture memory access 
load imbalance. In this scheme, the pixels stored in a particular memory bank are placed 
at the center and the vertices of hexagons packed in the frame buffer By making these 
hexagons close to regular so that the pixel placement is uniform and isotropic, frame 
buffer and texture memory accesses are evenly distributed over the m ... 

9 Identif yin g and Exploiting S patial Regularity in Data Memory References 

Tushar Mohan, Bronis R. de Supinski, Sally A. McKee, Frank Mueller, Andy Yoo, Martin Schulz 
November 2003 Proceedings of the 2003 ACM/IEEE conference on Supercomputing 
Publisher: IEEE Computer Society 

Full text available: *g pdf(264.75 KB) Additional Information: full citation , abstract 

The growing processor/memory performance gap causes the performance of many codes 
to be limited by memory accesses. If known to exist in an application, strided memory 
accesses forming streams can be targeted by optimizations such as prefetching, 
relocation, remapping, and vector loads. Undetected, they can be a significant source of 
memory stalls in loops. Existing stream-detection mechanisms either require special 
hardware, which may not gather statistics for subsequent analysis, or are limite ... 




10 D ynamically scheduled VLIW processors 
B. Ramakrishna Rau 

December 1993 Proceedings of the 26th annual international symposium on 
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A relatively small set of static instructions has significant leverage on program execution 
performance. These problem instructions contribute a disproportionate number of cache 
misses and branch mispredictions because their behavior cannot be accurately anticipated 
using existing prefetching or branch prediction mechanisms. 

The behavior of many problem instructions can be predicted by executing a small code 
fragment called a speculative slice. If a speculative slice is exec ... 



12 Zero -c ycle loads: microarchitecture su p port for reducin g load latency 
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A constraint-based system, PLS, capable of deriving a technically acceptable three- 
dimensional layout for a chemical process plant of typical complexity is described. In the 
application of PLS to an exemplar plant, over five hundred instances of constraints were 
identified and manipulated. PLS has been implemented in a frame system created in- 
house. This features recursively structured frames to represent both the initial plant data 
and the intermediates derived by the system. Rea ... 



14 Eva l uatin g the impact of memory system performance on software prefetchin g and 
locality o ptimizations 

Abdel-Hameed A. Badawy, Aneesh Aggarwal, Donald Yeung, Chau-Wen Tseng 
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Software prefetching and locality optimizations are techniques for overcoming the speed 
gap between processor and memory. In this paper, we evaluate the impact of memory 



trends on the effectiveness of software prefetching and locality optimizations for three 
types of applications: regular scientific codes, irregular scientific codes, and pointer- 
chasing codes. We find for many applications, software prefetching outperforms locality 
optimizations when there is sufficient memory bandwidth, but ... 
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Matthew Eldridge, Homan Igehy, Pat Hanrahan 
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Pomegranate is a parallel hardware architecture for polygon rendering that provides 
scalable input bandwidth, triangle rate, pixel rate, texture memory and display bandwidth 
while maintaining an immediate-mode interface. The basic unit of scalability is a single 
graphics pipeline, and up to 64 such units may be combined. Pomegranate's scalability is 
achieved with a novel "sort-everywhere" architecture that distributes work in a balanced 
fashion at every stage of the pipeline, ke ... 
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The demand for high-performance architectures and powerful battery-operated mobile 
devices has accentuated the need for power optimization. While many power-oriented 
hardware optimization techniques have been proposed and incorporated in current 
systems, the increasingly critical power constraints have made it essential to look for 
software-level optimizations as well. The compiler can play a pivotal role in addressing the 
power constraints of a system as it wields a significant influence on the ... 
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In this paper I present a parser based on Description Logics (DL) for a German HPSG- 
Style fragment. The specified parser relies mainly on the inferential capabilities of the 
underlying DL system. Given a preferential default extension for DL disambiguation is 
achieved by choosing the parse containing a qualitatively minimal number of exceptions. 
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The gap between processor and main memory performance increases every year. In order 
to overcome this problem, cache memories are widely used. However, they are only 
effective when programs exhibit sufficient data locality. Compile-time program 
transformations can significantly improve the performance of the cache. To apply most of 
these transformations, the compiler requires a precise knowledge of the locality of the 
different sections of the code, both before and after being transformed. Cache ... 
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Excessive power consumption is widely considered as a major impediment to designing 
future microprocessors. With the continued scaling down of threshold voltages, the power 
consumed due to leaky memory cells in on-chip caches will constitute a significant portion 
of the processor's power budget. This work focuses on reducing the leakage energy 
consumed in the instruction cache using a compiler-directed approach. We present and 
analyze two compiler-based strategies termed as conservative and optim ... 
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